Search results for "Reference genome"

showing 10 items of 27 documents

Chloroplast genomes of Rubiaceae: Comparative genomics and molecular phylogeny in subfamily Ixoroideae.

2020

In Rubiaceae phylogenetics, the number of markers often proved a limitation with authors failing to provide well-supported trees at tribal and generic levels. A robust phylogeny is a prerequisite to study the evolutionary patterns of traits at different taxonomic levels. Advances in next-generation sequencing technologies have revolutionized biology by providing, at reduced cost, huge amounts of data for an increased number of species. Due to their highly conserved structure, generally recombination-free, and mostly uniparental inheritance, chloroplast DNA sequences have long been used as choice markers for plant phylogeny reconstruction. The main objectives of this study are: 1) to gain in…

0106 biological sciences0301 basic medicineChloroplastsPlant GenomesCoffeaRubiaceaePlant SciencePlant Genetics01 natural sciencesGenomePlant GenomicsPlastidsGenome EvolutionPhylogenyData ManagementMultidisciplinaryIxoroideaeQDNA ChloroplastRHigh-Throughput Nucleotide Sequencingfood and beveragesPhylogenetic AnalysisGenomicsPhylogeneticsChloroplast DNAEngineering and TechnologyMedicineGenome PlantResearch ArticleBiotechnologyGenome evolutionComputer and Information SciencesNuclear genePlant Cell BiologyScienceGenomicsBioengineeringBiology010603 evolutionary biologyPolymorphism Single NucleotideMolecular EvolutionEvolution Molecular03 medical and health sciencesChloroplast GenomeGeneticsEvolutionary SystematicsGenome ChloroplastTaxonomyComparative genomicsEvolutionary BiologyBiology and Life SciencesComputational BiologyCell BiologySequence Analysis DNAComparative Genomicsbiology.organism_classificationGenome AnalysisGenomic Libraries030104 developmental biologyEvolutionary biologyPlant BiotechnologyReference genomePLoS ONE
researchProduct

One is not enough: On the effects of reference genome for the mapping and subsequent analyses of short-reads.

2020

Mapping of high-throughput sequencing (HTS) reads to a single arbitrary reference genome is a frequently used approach in microbial genomics. However, the choice of a reference may represent a source of errors that may affect subsequent analyses such as the detection of single nucleotide polymorphisms (SNPs) and phylogenetic inference. In this work, we evaluated the effect of reference choice on short-read sequence data from five clinically and epidemiologically relevant bacteria (Klebsiella pneumoniae, Legionella pneumophila, Neisseria gonorrhoeae, Pseudomonas aeruginosa and Serratia marcescens). Publicly available whole-genome assemblies encompassing the genomic diversity of these species…

Systematic errorSingle Nucleotide PolymorphismsPathology and Laboratory MedicineGenomeKlebsiella PneumoniaeDatabase and Informatics MethodsData sequencesKlebsiellaMedicine and Health SciencesBiology (General)CladePhylogenyData ManagementEcologyPhylogenetic treeBacterial GenomicsMicrobial GeneticsChromosome MappingHigh-Throughput Nucleotide SequencingPhylogenetic AnalysisGenomicsBacterial PathogensPhylogeneticsLegionella PneumophilaComputational Theory and MathematicsMedical MicrobiologyModeling and SimulationPathogensSequence AnalysisResearch ArticleComputer and Information SciencesBioinformaticsQH301-705.5LegionellaSequence alignmentSingle-nucleotide polymorphismGenomicsComputational biologyMicrobial GenomicsBiologyResearch and Analysis MethodsPolymorphism Single NucleotideMicrobiologyCellular and Molecular NeurosciencePhylogeneticsGeneticsSNPBacterial GeneticsEvolutionary SystematicsMolecular BiologyMicrobial PathogensEcology Evolution Behavior and SystematicsTaxonomyEvolutionary BiologyBacteriaOrganismsBiology and Life SciencesBacteriologySequence AlignmentGenome BacterialReference genomePLoS Computational Biology
researchProduct

Non-Redundant tRNA Reference Sequences for Deep Sequencing Analysis of tRNA Abundance and Epitranscriptomic RNA Modifications

2021

Analysis of RNA by deep-sequencing approaches has found widespread application in modern biology. In addition to measurements of RNA abundance under various physiological conditions, such techniques are now widely used for mapping and quantification of RNA modifications. Transfer RNA (tRNA) molecules are among the frequent targets of such investigation, since they contain multiple modified residues. However, the major challenge in tRNA examination is related to a large number of duplicated and point-mutated genes encoding those RNA molecules. Moreover, the existence of multiple isoacceptors/isodecoders complicates both the analysis and read mapping. Existing databases for tRNA sequencing pr…

0301 basic medicinelcsh:QH426-470ved/biology.organism_classification_rank.speciesComputational biologyBiology01 natural sciencesArticleDeep sequencingdeep sequencing03 medical and health sciencesRNA modificationsRNA Transferepitranscriptome[SDV.BBM.GTP]Life Sciences [q-bio]/Biochemistry Molecular Biology/Genomics [q-bio.GN]Escherichia coliGeneticsModel organismtRNAGeneComputingMilieux_MISCELLANEOUSGenetics (clinical)Sequence Analysis RNA010405 organic chemistryved/biologyreference sequenceHigh-Throughput Nucleotide SequencingRNA[SDV.BBM.BM]Life Sciences [q-bio]/Biochemistry Molecular Biology/Molecular biologyquantification0104 chemical scienceslcsh:GeneticsRNA Bacterial030104 developmental biologyTransfer RNADatabases Nucleic AcidtRNA poolBacillus subtilisReference genomeGenes
researchProduct

MetaCache: context-aware classification of metagenomic reads using minhashing.

2017

Abstract Motivation Metagenomic shotgun sequencing studies are becoming increasingly popular with prominent examples including the sequencing of human microbiomes and diverse environments. A fundamental computational problem in this context is read classification, i.e. the assignment of each read to a taxonomic label. Due to the large number of reads produced by modern high-throughput sequencing technologies and the rapidly increasing number of available reference genomes corresponding software tools suffer from either long runtimes, large memory requirements or low accuracy. Results We introduce MetaCache—a novel software for read classification using the big data technique minhashing. Our…

0301 basic medicineStatistics and ProbabilityComputer scienceSequence analysisContext (language use)BiochemistryGenome03 medical and health scienceschemistry.chemical_compound0302 clinical medicineRefSeqHumansMolecular BiologyInformation retrievalShotgun sequencingHigh-Throughput Nucleotide SequencingSequence Analysis DNAComputer Science ApplicationsComputational Mathematics030104 developmental biologyComputational Theory and MathematicschemistryMetagenomicsMetagenomics030217 neurology & neurosurgeryDNAAlgorithmsSoftwareReference genomeBioinformatics (Oxford, England)
researchProduct

Mycobacterium tuberculosiscomplex lineage 5 exhibits high levels of within-lineage genomic diversity and differing gene content compared to the type …

2020

AbstractPathogens of theMycobacterium tuberculosiscomplex (MTBC) are considered monomorphic, with little gene content variation between strains. Nevertheless, several genotypic and phenotypic factors separate the different MTBC lineages (L), especially L5 and L6 (traditionally termedMycobacterium africanum), from each other. However, genome variability and gene content especially of L5 and L6 strains have not been fully explored and may be potentially important for pathobiology and current approaches for genomic analysis of MTBC isolates, including transmission studies.We compared the genomes of 358 L5 clinical isolates (including 3 completed genomes and 355 Illumina WGS (whole genome seque…

Genetics0303 health sciencesLineage (genetic)030306 microbiologySequence assemblySingle-nucleotide polymorphismBiologybiology.organism_classificationGenome3. Good health03 medical and health sciencesMycobacterium tuberculosis complexGeneMycobacterium africanum030304 developmental biologyReference genome
researchProduct

A web application for the unspecific detection of differentially expressed DNA regions in strand-specific expression data

2015

Abstract Genomic technologies allow laboratories to produce large-scale data sets, either through the use of next-generation sequencing or microarray platforms. To explore these data sets and obtain maximum value from the data, researchers view their results alongside all the known features of a given reference genome. To study transcriptional changes that occur under a given condition, researchers search for regions of the genome that are differentially expressed between different experimental conditions. In order to identify these regions several algorithms have been developed over the years, along with some bioinformatic platforms that enable their use. However, currently available appli…

Statistics and ProbabilitySequence analysisADNGenomicsComputational biologyBiologycomputer.software_genreBiochemistryGenomeComputer GraphicsExpressió genèticaWeb applicationHumansMolecular BiologyGeneInternetMicroarray analysis techniquesbusiness.industryGenome HumanGene Expression ProfilingComputational BiologyHigh-Throughput Nucleotide SequencingDNAGenomicsSequence Analysis DNAComputer Science ApplicationsGene expression profilingComputational MathematicsGenòmicaComputingMethodologies_PATTERNRECOGNITIONComputational Theory and MathematicsData miningbusinesscomputerAlgorithmsGenèticaReference genome
researchProduct

Phylogenetic Distribution of Polysaccharide-Degrading Enzymes in Marine Bacteria

2021

Deconstruction is an essential step of conversion of polysaccharides, and polysaccharide-degrading enzymes play a key role in this process. Although there is recent progress in the identification of these enzymes, the diversity and phylogenetic distribution of these enzymes in marine microorganisms remain largely unknown, hindering our understanding of the ecological roles of marine microorganisms in the ocean carbon cycle. Here, we studied the phylogenetic distribution of nine types of polysaccharide-degrading enzymes in marine bacterial genomes. First, we manually compiled a reference sequence database containing 961 experimentally verified enzymes. With this reference database, we annota…

Microbiology (medical)ecological differentiationPhylogenetic treePhylumcarbohydrate active enzymeslcsh:QR1-502polysaccharide-degrading enzymesGenomicsBacterial genome sizeCellulaseBiologyphylogenyMicrobiologylcsh:MicrobiologyMarine bacteriophagemarine bacteriaEvolutionary biologyPhylogeneticsbiology.proteingenomicsReference genomeOriginal ResearchFrontiers in Microbiology
researchProduct

Whole genome sequencing of the black grouse (Tetrao tetrix): reference guided assembly suggests faster-Z and MHC evolution

2014

Background The different regions of a genome do not evolve at the same rate. For example, comparative genomic studies have suggested that the sex chromosomes and the regions harbouring the immune defence genes in the Major Histocompatability Complex (MHC) may evolve faster than other genomic regions. The advent of the next generation sequencing technologies has made it possible to study which genomic regions are evolutionary liable to change and which are static, as well as enabling an increasing number of genome studies of non-model species. However, de novo sequencing of the whole genome of an organism remains non-trivial. In this study, we present the draft genome of the black grouse, wh…

Tetrao tetrixMaleGenome evolutionBiologyGenomePolymorphism Single NucleotideChromosomesBirdsEvolution MolecularMajor Histocompatibility ComplexGene densityGeneticsAnimalsGenetikGenome sizeRepetitive Sequences Nucleic AcidGeneticsComparative genomicsWhole genome sequencingteeriGenomeComputational BiologyHigh-Throughput Nucleotide SequencingMolecular Sequence AnnotationGenome projectGenomicsEvolutionary biologyReference genomeBiotechnologyResearch ArticleBMC Genomics
researchProduct

Inferring heterozygosity from ancient and low coverage genomes

2016

Abstract While genetic diversity can be quantified accurately from high coverage sequencing data, it is often desirable to obtain such estimates from data with low coverage, either to save costs or because of low DNA quality, as is observed for ancient samples. Here, we introduce a method to accurately infer heterozygosity probabilistically from sequences with average coverage <1× of a single individual. The method relaxes the infinite sites assumption of previous methods, does not require a reference sequence, except for the initial alignment of the sequencing data, and takes into account both variable sequencing errors and potential postmortem damage. It is thus also applicable to …

Male0301 basic medicineHeterozygotePopulationGenomicsInvestigationsBiologyGenome03 medical and health sciences0302 clinical medicineGeneticsheterozygosityHumanslow coverageDNA AncienteducationPopulation and Evolutionary Geneticsancient DNA030304 developmental biologyGeneticsWhole genome sequencing0303 health scienceseducation.field_of_studyGenetic diversityBase SequenceGenome HumanGenetic Carrier ScreeningChromosome MappingGenetic VariationContrast (statistics)Coverage dataSequence Analysis DNApostmortem damageVariable (computer science)Genetics Population030104 developmental biologyAncient DNAEvolutionary biologybase recalibrationSoftware030217 neurology & neurosurgeryReference genome
researchProduct

Establishing gene models from the Pinus pinaster genome using gene capture and BAC sequencing

2016

Background In the era of DNA throughput sequencing, assembling and understanding gymnosperm mega-genomes remains a challenge. Although drafts of three conifer genomes have recently been published, this number is too low to understand the full complexity of conifer genomes. Using techniques focused on specific genes, gene models can be established that can aid in the assembly of gene-rich regions, and this information can be used to compare genomes and understand functional evolution. Results In this study, gene capture technology combined with BAC isolation and sequencing was used as an experimental approach to establish de novo gene structures without a reference genome. Probes were design…

0301 basic medicineChromosomes Artificial BacterialDNA PlantGenomicsBiologyMaritime pineGenome03 medical and health sciencesGene captureGeneticsGene familyGenomic libraryGeneBACGene LibraryGeneticsModels GeneticExonsGenomicsSequence Analysis DNAPinusIntronsGene structurePromoter studies030104 developmental biologyBioinformatic pipelineGene model constructDNA microarrayFunctional genomicsGenome PlantReference genomeResearch ArticleBiotechnologyBMC Genomics
researchProduct